Talend Big Data Basics
SubscriptionThis content is available for Talend Academy subscription users.Instructor-ledThis content is available as instructor-led training.
Talend provides a development environment that enables you to interact with many big data sources and targets without having to understand or write complicated code.
Talend Big Data Basics is an introduction to the Talend components shipped with several products that interact with big data systems.
Duration: 2 days (14 hours)
Target audience: Anyone who wants to use Talend Studio to interact with big data systems
Prerequisites: Completion of Introduction to Talend Studio, Talend Data Integration Basics, or Talend Data Integration Advanced
Badge: Complete this learning plan to earn the Talend Big Data Developer Practitioner badge. To know more about the criteria to earn this badge, refer to the Talend Academy Badging Program page.
Learning objectives: After completing this learning plan, you will be able to:
-
Create cluster metadata
-
Create HDFS and Hive metadata
-
Connect to your cluster to use HDFS, HBase, Hive, Pig, and MapReduce
-
Read data from and write it to HDFS (HDFS, HBase)
-
Read tables from and write them to HDFS (Hive)
-
Process tables stored in HDFS with Hive
-
Process data stored in HDFS with Pig
-
Process data stored in HDFS with Big Data Batch Jobs
Training modules: To complete the learning plan, take the following training modules: